What does a large deviation look like? 
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Large deviation theory quantifies the occurence of events that deviate from the average behavior 
of a system. Such events arise from non-typical trajectories of the dynamics. In this note we derive 
the time evolution of these rare trajectories. 



I. PROBLEM FOMULATION 



o 

A large deviation calculation typically answers the question: What is the probability to observe, say, a value £ of the 
. entropy production? (or, more accurately, what is the exponential rate 1(0 at which this probability decreases in time) 
<D . 

Here the questions we investigate are the following: What does the dynamics leading to such a deviation £ look 
like? Which trajectories are associated to such deviations? 

We adress these questions in the context of Markov chains and derive the effective dynamics governing the large 
' fluctuations of size £. 

o; 
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^ ' A. Markov chains 

We consider a Markov chain characterized by a transition matrix G = (G^) £ M. NxN on the finite state space fi. 
^ " The probability distribution p = (pi,p2, ■ ■ ■ ,Pn) evolves in discrete time steps according to 



p(n+l)=p(n)G. (1) 

The evolution operator G is stochastic, i.e. it is non-negative (G > 0) and its rows sum to one = !)• 

We assume that the Markov chain is primitive, i.e., there exists an no such that G n ° has all positive entries. This 
guarantees that G has a unique stationary distribution n such that 

tt = ttG. (2) 



. B. Large deviations of the entropy production 

m 

For concreteness, we consider the large deviations of the entropy production. The entropy production is a fluctuating 
quantity measuring the dissipation occurring along a trajectory i — > i\ iy. —>•... — s> i n of the system. It 

takes the form 

sW^lnjng^] . (3) 



vfc=0 ^ + 

At long times, its probability distribution is characterized by the rate function 



n— >oo ri 

An equivalent description is given in terms of the generating function 

1 • I -AS(n) 



1(0= Km --InProb S(n)/n = £ . (4) 



Q(X) = lim — ln(e- A& w) . (5) 
The generating function is related to the largest eigenvalue p(X) of the operator 

(Lx) i . = G}^ i (6) 

as follows: 

Q(A) = -lnp(A). (7) 
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The operator (J6j) is non-negative but not stochastic: ^ 1 when A ^ 0. When A = we recover the original 

evolution operator: L\ = q = G. 

The descriptions in terms of the rate function I and the generating function Q are related through the Legendre 
transform 

Q(A) = min[/(0 + A£] . (8) 

These functions quantify the occurence of events of size £. The goal of this paper is to go a step further and reveal 
their dynamical properties. 

II. LARGE DEVIATIONS ARE CHARACTERIZED BY AN EXTREMAL PRINCIPLE 



We consider the space of stochastic matrices compatible with an irreducible non- negative matrix A = (Ay ) G 
A stochastic matrix P — (Py) £ M. NxN is said to be compatible with A if P satisfies 



We then have the following 

Theorem. Let E^, be the space of stochastic matrices compatible with G. Then 



Q{\) = min 



and ifs stationary probability distribution 



pNxN 



1) P y > if Aij > (9a) 

2) P y = if Ay = . (9b) 



(10) 



Here q is the stationary probability distribution of P (q = qP). 

DEMONSTRATION. Direct application of the Theorem 1.2 of Ref. Q to the operator L\. □ 
Furthermore, the optimal dynamics is given by the following 
Theorem. The equality in U0\) holds for the stochastic dynamics 

P * = ^diag(x A )- 1 L\ diag(z A ) (11) 



1\ = — f ■ 12 



Here X\ > and y\ > are, respectively, the right and left eigenvectors of L\ corresponding to the largest eigenvalue 
p(X). diag(a;) denotes the diagonal matrix with x on its diagonal and y ox denotes the vector (yiX\, yjv^jv)- 

DEMONSTRATION: Insert (JTTJ) and dHJ) into dTOj) and check that we have equality. □ 
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III. INTERPRETATION 



The result (fT0|) has a clear physical interpretation. The term 



is the entropy production observed when fluctuations generate the statistics P. The term 

J[P] = £>Py In (§ L ) (14) 

is the Kullback-Leibler distance between the dynamics P and G. It corresponds to the probability rate to observe 
fluctuations with statistics P in the original dynamics. The generating function at value A thus selects out fluctuations 
that minimize the deviations from the original dynamics along with the A-weighted entropy production. 

The physical meaning of the terms (IT5|) and (fLT| allows the connection with the rate function The optimal 
dynamics P£ generates a dissipation £(A) = D[P£]. Inverting this relation, the rate function is given by 

no = ■ (15) 

This establishes the link between the optimal Markov chains and the large deviations. The typical trajectories of 
the optimal chain P£ 1) lead to an entropy production £(A) and 2) appear at the rate /(£) in the original dynamics. 
We thus conclude that the large fluctuations of size £(A), when observed, appear through the dynamics P\(gy This 
answers our original question on what large devations look like. 

IV. TIME REVERSAL SYMMETRY AND LARGE DEVIATION PROPERTIES 

The symmetry under time reversal implies a 

Fluctuation Theorem. The generating function has the symmetry 

Q(\) = Q(1-X). (16) 

DEMONSTRATION: Direct consequence of the relation L\ = LJ_ X . □ 

How does this symmetry affect the optimal chains? We start by looking at particular cases: 



• A = 0: The optimal dynamics is identical to the original chain, P\ =0 = G, and the generating function 
Q(A = 0) = 0. 

• A = 1/2: The optimal dynamics -Px =1 / 2 satisfies the detailed balance conditions, leading to D = £ = 0. The 
generating function thus takes the value Q(l/2) = /(£ = 0). That is, the optimal dynamics minimizes the 
Kullback-Leibler distance with respect to the original chain, given the detailed balance constraints. 

• A = 1: The optimal dynamics is the time reversal of the original chain, P£ =1 = G R . The time reversal of a 
Markov chain G is defined as 

G R = diagO)- 1 G T diag(ir) (17) 

where it > is the stationary distribution of the chain G. 
More generally, we have the following 

Theorem. The chains P£ and P*_^ are time reversal of each other. 
DEMONSTRATION: Left as an exercise. 



This theorem reveals that the paths leading to a given large deviation and those leading to its opposite one are 
time reversal of each other. 
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V. EXAMPLE: LARGE DEVIATIONS OF A THREE-STATE SYSTEM 

The generating function is given by Q(A) = J(A) + AD(A), where D(X) = D[P*} = £(A) and J(A) = J[P*} (Fig. [Q). 
It satisfies the fluctuation theorem (section [IV]). The effective entropy production D(X) is antisymmetric with respect 
to A = 1/2, where it vanishes due to the equilibrium dynamics (section ITVl) . At A = it takes the value of the average 
entropy production of the original dynamics G. The rate function /(£) is obtained from J (A(£)) (not shown). 




FIG. 1: Generating function and its decomposition in a three-state system. The generating function is determined 
by the rate J and the effective entropy production D of the optimal dynamics P^ ■ The transition probabilities take the values 
P12 = 0.8, Pis = 0.2, P 2 i = 0.1, P 23 = 0.9, P 3 i = 0.7, P 32 = 0.3, and otherwise. 

Figure[5]illustrates the dynamics leading to different rare events, as measured by different values of A or, equivalently, 
of the entropy production £(A). The first row corresponds to the original dynamics G. There is a clear temporal 
structure in the trajectory, with an average positive circulation (i.e., a tendency to go in the direction 1 — > 2 — > 3 — > 
1 — > . . .). This average tendency disappears progressively until we reach A = 1/2, which corresponds to an equilibrium 
dynamics and fluctuations of vanishing entropy production. For A > 1/2, the fluctuations are, on average, going in 
the opposite direction (i.e., in the direction 1 — >• 3 — > 2 — >■ 1 — >• . . .). Finally, the dynamics at A = 1 is the time reversal 
of G. More generally, large deviations of opposite values are generated through a time-reversed dynamics (section 

El). 
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VI. CONCLUSIONS 



Large deviations can be described in terms of an extremal principle over the space of Markov chains. The associated 
optimal chains describe the dynamical behavior of these large deviations. This approach presents neat mathematical 
properties and offers new insights into the dynamical origin of large deviations. The present results readily translate 
to other quantitiers of interest such as the thermodynamic currents. I believe that the knowledge of the paths leading 
to rare events will play an important role in the description and understanding of nonequilibrium systems. 

Disclaimer. This paper is not intended for journal publication. Therefore, it lacks the delusional claims typically 
found in published papers about its (simultaneous) relevance to drug design, nanoelectronics, photonics, spintronics 
or any other fashionable application. Also, everyone can sleep at peace without worrying about missed citations and 
their h- index (and without e- mailing me to complain about them). 



[1] Shi-Feng Shieh, Entropy 13, 2036 (2011). 




FIG. 2: Large deviation dynamics in a three-state system. Each row corresponds to a large deviation of given size (fixed 
A or (). The left-hand side displays the transition matrix of the optimal chains [final state = 1 (blue), 2 (green), 3 (red)]. The 
right-hand side shows the corresponding stochastic trajectories. The parameters take the same values as in Fig. [T] 



